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Ve  treat  linear  model)Y^  =  c|  where  is  a  known  vector,  (3  is  an 

_  ^  J 

unknown  parameter,  and  Var  Z.  is  a  function  of  |C:  8|  which  is  known,  except 

"  r,  0  ^  1  ~l  ~ 


^  for  a  parameter  0.  For  simultaneous  M-estimates,  and- Qy^we^, show  that 

(0-0)  =  Op(N  ^),  and*  find  the  limit  distribution ^>f^l(8-8)^*  For  the  special 
case  of  least  squares  estimation,  this  limit  distribution  is  the  same  as  the 
limit  distribution  of  the  weighted  least  squares, jusing  the  weights, 
w^=  (Var  Z^)  1 ,* and  in  general  the  distribution  is  that  of  a  ""weighted 
M- estimate**' using  these  weights.  Moreover,  the  covariance  matrix  of  the 
limit  distribution  can  be  consistently  estimated,  so  large  sample  confidence 
ellipsoids  and  tests  of  hypotheses  concerning  £  are  feasible. 
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1.  Introduction. 


Consider  the  linear  model 


Y . =  t.  +  a.e. 
11  i  i 


and 

Ti=  for  *  =  1,...,N, 

where  e.,...,eM  are  i.i.d.  with  distribution  F.C.(=C..,)  are  known  elements  in 
R*5,  B  is  a  vector  parameter  in  R*5,  and  a  ,  ...,ctn  are  constants,  which  express 
the  possible  heteroscedasticity  of  the  model.  If  F  were  a  normal  distribution 
function  and  the  o.  were  known  then  the  minimum  variance  unbiased  estimator 

A 

of  3  would  be  the  weighted  least  squares  estimator  (L.,  which  is  the  solution  to 

N  x  A 

(1.1)  l  P(  (Y.  -C.  3J/o )  =  minimum, 

i_l  1  1  -w  1 


2 

where  p(x)  =  x  .  If  F  has  heavy  tails  compared  with  the  Gaussian  distribution 

or  gross  errors  are  possibly  present  in  the  data,  then  one  may  wish  to  replace 
2 

p(x)  =  x  with  a  function  such  that  ^  =  p’  is  bounded.  Such  estimators  have 
been  called  M-estimators  by  Huber  (1964,  1973),  because  they  are  generali¬ 
zations  of  maximum  likelihood  estimators.  If  the  are  not  known  it  may 
still  be  possible  to  estimate  them.  For  example.  Fuller  and  Rao  (1978)  con¬ 
sider  the  case  where  the  Y^  occur  in  groups  for  which  cr  is  constant,  and 
Box  and  Hill  (1974)  assume  that 


(1.2)  0^9)  =  o.  =  o|t.|9 

for  an  unknown  parameter  0  and  estimate  both  0  and  £  by  Bayesian  methods. 

Both  Fuller  and  Rao  and  Box  and  Hill  treat  only  Gaussian  errors. 

Anscombe  (1961)  has  proposed  tests  of  heteroscedasticity,  and  Bickel 
(1978)  has  developed  robust  versions  of  these,  but  neither  has  considered 
modifying  the  estimate  of  3  if  the  null  hypothesis  of  homoscedasticity  is  rejected 
In  many  empirical  studies,  one  finds  that  the  dispersion  of  the  residuals 


ortCIAL 


□  □ 
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increases  with  the  magnitude  of  the  fitted  values,  so  that  it  is  reasonable 
to  assume  that  =  o(|x.J)  for  some  nondecreasing  function  o.  While  the 
power  family  (1.2)  may  be  sufficiently  rich  to  model  this  phenomenon  (although 
it  has  obvious  difficulties  in  practice  when  x^=  0) ,  we  prefer  to  study  the 
general  class  , 

(1.3)  o.  =  exp(h(x.)T  6), 

where  0  is  a  parameter  in  Rl*  and  h  is  a  known  function  from  R*  to  . 

Notice  that  choosing  h^x^)  =  1  (h^  is  the  first  coordinate  of  h ) ,  expfO^) 
becomes  a  scale  parameter  ;  as  a  result,  our  estimate  of  (3  will  be  scale 
equivariant  regardless  of  the  choice  of  p.  Two  choices  of  h  which  have  no 
practical  difficulties  when  x^=  0  are  h(x)  =  (1,  log ( 1 + | T | ) )  and  h(x)  =  (l,|x|). 
To  motivate  our  method,  suppose  F  is  standard  normal  so  that  the  log- 

likelihood  is  .  N  2  . 

L(Y;0,B)  =  -  j  l  {log  2 it  ♦  log  0  ♦  ((Y  -X  )/o  T)  . 

i=l  1  iii 

If  0_  were  known,  (1.1)  would  yield  the  MLE  for  |3,  while  if  f?  were  known,  the 
MLE  of  0^  would  solve 

N  7 

(1.4)  l  ( ( (Y . -X . )/o. (0)  j  -  1}  h(x  )  =  0. 

i=l  1  1  1  1 

A  reasonable  computational  alternative  to  solving  (1.1)  and  (1.4)  simultaneous¬ 
ly  might  consist  of  (i)  obtaining  a  preliminary  estimate  of  B  (such  as  the  least 
squares  estimate)  and  hence  estimates  for  T^»'**>Tn  (ii)  solve  (1.4)  using 
these  estimates,  thus  obtaining  estimates  of  Oj,...,o  ,  which  (iii)  are  used 
to  solve  (1.1). 

We  thus  suggest  the  following  procedure.  First,  a  preliminary  estimate 

/\ 

Sq  of  3  is  calculated  and  assumed  to  satisfy 

(Bq-  3)  -  Op(N^). 


(1.5) 
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Examples  of  estimates  satisfying  (1.5)  are  given  in  Section  5.  At  the  second 

T  A 

stage,  define  t.  =  C.  0.  and  obtain  robustified  estimates  of  0  by  solving  the 

l—l—O  — 

following  analogue  of  (1.4): 


(1.6) 


l  ty2((Y.-t.)  exp(-h(ti)T  0))  -  O  h(ti)  =  0, 
i=l 


2 

where  ip  is  monotone  nondecreasing  and  E,  =  E<p  (^),  the  expectation  being  taken 
under  the  standard  normal  distribution.  Clearly,  i|>(x)  =  x  leads  to  (1.4). 

At  the  third  stage,  we  now  solve  a  robust  version  of  (1.1): 


(1.7) 


where 


(1.8)  Oj  =  exp(-h(ti)T  0). 

Our  main  result  (Theorem  4.1)  lists  conditions  under  which  the  limit  dis- 

A 

tribution  of  3  defined  by  (1.5) -(1.8)  is  the  same  as  that  of  the  estimate  which 
could  be  found  by  solving  (1.7)  when  are  known.  In  Section  2  we 

introduce  notation  and  assumptions.  Section  3  demonstrates  that 


(1.9)  0  -  0  =  0p(N-*)  . 

In  Section  4  we  state  and  prove  the  main  result. 


Remarks. 

A 

A 

A  If  one  assumes  homoscedasticity  (o^  =  o),  then  h(T^)  *  1.  The  estimate  be¬ 
comes  an  ordinary  robust  regression  estimate  with  preliminary  estimate  of 
scale  given  by  (1.5)- (1.6).  See  Maronna  and  Yohai  (1979)  for  further  details. 

B  We  do  not  know  if  iterating  (1.5)- (1.8)  will  lead  to  convergence.  Further,  we 
do  not  know  whether  simultaneous  solution  of  (1.6)- (1.7)  is  possible. 
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C  Throughout  the  paper,  we  will  use  the  following  convention.  For  a  real  function 
f  on  a  space  X,  we  say  that  x^  solves  f(x)  =0  if 

(1.10)  |f(xn)|  <  2  inf  |f (x) | . 

With  this  convention,  all  our  estimators  will  exist  but  need  not  be  unique. 
However,  our  asymptotic  results  hold  for  every  appropriate  sequence  of  esti¬ 
mators  . 

D  The  Box  and  Cox  (1964)  transformations  form  an  alternative  method  for  dealing 
with  heteroscedasticity,  as  well  as  other  deviations  from  the  normal  linear 
model.  Defining 

Y(X)  =  (YX-l)/\  (A  t  0) 

=  log  Y  (X  =  0), 

they  postulate  that  for  some  X,  Y^  satisfies  a  homoscedastic  normal  linear 
model.  Their  methodology  is  based  upon  an  entirely  different  model  from  ours, 
but  a  practitioner  might  consider  using  both  on  a  given  data  set.  Carroll 
(1978)  and  Bickel  and  Doksum  (1978)  independently  studied  both  Box-Cox  and 
robustified  Box-Cox  methods  and  concluded  that  variances  of  the  estimated  coef¬ 
ficients  in  the  linear  model  are  often  much  larger  when  X  is  estimated  than 
when  it  is  known.  Therefore,  confidence  intervals  and  tests  of  hypotheses  which 
are  constructed  as  if  A  were  known  and  not  estimated  are  invalid. 

In  contrast,  we  show  in  Section  4  that,  if  the  errors,  e^,  are  symmetrically 
distributed  or  ip (x)  =  x,  then  for  our  method,  the  variances  of  estimated  coef¬ 
ficients  when  0  is  estimated  are  similar  to  those  when  0  is  known,  and  confidence 
intervals  and  tests  can  be  validly  constructed  as  if  0  were  known. 
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2.  Assumptions  and  notations.  Let  ^  be  a  nondecreasing  function  from 
R1  to  R*  which  satisfies 

Bl.  EiKej)  =  0, 

B2.  0  <  E4»2(ei)  =  C  < 

for  some  a  >  0 

B3.  E<M(e1+r)(l+s))  =  A (*)  r  +  0(|r|1+a  4  |s|1+a) 

as  r  +  0  and  s  -*•  0  with  A(<Jj)  >  0, 

B4 .  E^2((e1+r)(l4s))  -  5  =  A (i|>2)  s  4  O(|r|1+0t  4  |s|1+0t) 

2 

as  r  -*■  0  and  s  0  with  A(i|;  )  >0, 

B5.  there  exists  Cq  such  that  for  all  6  <  1 

lim  sup  E{sup|4>((e.+r)  (1+s))  -  <}>( (e.+r')  (l4s  '))  | : 
k-K) 


BIO.  h  is  Lipschitz  continuous  on  an  interval  (possibly  infinite),  I,  such 

that  T.  is  in  I  for  all  i  and  N, 
i 


page 


Bll. 

and 

B12. 


inf  inf  h(x.)^  0  >  -<*> 

N  i<N  1 

letting  tL  =  exp(-h(x^)T  0), 


l  N  T 

’l  I  d.dT  = 
i=l 


S..  -  S 


(positive  definite). 


Remark .  The  monotonicity  of  ij;  is  needed  only  to  prove  -  consistency 

A  A 

of  and  £. 

The  conditions  B1-B6  are  notationally  complex  but  widely  applicable. 

2  4 

They  clearly  hold  for  p(x)  =  x  /2  (^(x)  =  x)  if  Ee^  0,  Ee^  <  °°,  where 
2  2 

A  (ip)  =  1,  A(i|>  )  =  2Ee  j  .  If  F  is  symmetric  and  ip  is  odd,  B1-B6  can  be  veri¬ 
fied  if  (i)  ip  is  constant  outside  an  interval  (ii)  ip  is  Lipschitz  continuous 
and  twice  boundedly  differentiable  except  possibly  at  a  finite  number  of  points, 

aj,...,a^  (iii)  F  is  Lipschitz  continuous  in  neighborhoods  of  a^,...,a^  and 
2 

(iv)  Eip  ( e p,  Ei/j' (ep  and  Ee^fe^)  ^'(ep  are  all  positive.  Then  A(^)  =  E^' (e^ 
and  A(ip2)  =  2Ee1</<(e1)  ^ '  (e1 ) . 

For  the  power  model  (1.2),  h(x)  =  (1,  log  | x |  )^ 0_  =  (log  o,0)  and  BIO,  Bll 
hold  if 

(2.1)  inf  inf  |x. |  >  0. 

N  l<i<N  1 

Since  the  power  model  (1.2)  makes  little  sense  when  x^  =  0,  (2.1)  is  reasonable. 
To  avoid  this  difficulty,  in  Section  1  we  suggested  h^(x)  =  (1,  log(l+|x|)),  for 
which  BIO,  Bll  hold.  The  other  example  h(x)  =  (l,|x|)  will  satisfy  BIO,  Bll 
as  long  as  the  (x ^ }  are  uniformly  bounded. 
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there  exists  >^>0  such  that 


|<B> 
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(3.6)  pf  inf  I  I WN (A)  |  |  >  Q' 

I  I  A.  I  I  <M.  N  s  1  -  e 

1i|A2ii^m2 

y  ' 

We  will  prove  (3.6)  by  modifying  the  proof  of  Jureckova's  (1977)  Lemma  5.2. 
We  first  apply  Theorem  A.l  of  the  appendix,  with  =  h  (0)  and 

aP^  (A)  =  h ^  (A)  -  hu  (0) .  Then  B1-B8,  BIO,  Bll  imply  that  the  assumptions  of 

2 

Theorem  A.l  are  met  with  g^=  1,  A (<J>,  i)  =  A(ip  ),  so  that  for  all  M>0, 

sup  W  (A)  -  W  (0)  -  AM2)N‘%  l  h(x .)a(1)(A) 

1|A||<M  N  "  N  "  i=l  ~  1  1  “ 

= 

By  a  Taylor  series  expansion, 

aPV)  =  -N‘%h(x.)T  A2  ♦  (h.(0)-h.(A))T  0  +  op(N'%). 

Thus,  by  B9,  setting 

i  N  T 

GN(A)  =  N‘?  l  h(T  )(h  (0)-h  (A))1  0, 
in  iml  1  1  1 

(3.7)  sup  ||W (A)  -  W  (0)  -  A(^2)HA  +  G  (A) | |  =  o  (1). 

||a||<m  p 

Now  fix  e>0,  Mj>0,  Q>0 .  Use  (3.5)  to  choose  y  such  that 
P ( 1 1 WN ( 0) I  I  *  Y/2)  <  e/2. 

Define 

D  =  sup  sup  | | G„ (A) ( | . 

N  llAjllsMj 

Then  D<°°  (G^  depends  only  on  A^).  Define  M2  by 
tA(^2)XooM2/2  -  y  -  D]  =  Q. 

Using  B9  and  (3.7),  find  Nfl  such  that  XN  >  \J2  and 

pf  sun  ||WN(A)  -  WN(0)  -  A(i|;2)HA2-  Gn(A)  |  |  <  J 

I  I A  1  |=M 

*  1  -  e/2  (N2NQ). 
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If  | | A2 | |  =  M2,  | | Aj | |  <  Mj,  and  N  2  NQ,  then  with  probability  at  least  1-e, 


A2WNCA) 

>  -M2|  |WnCC0  |  |  +  ^2  HA2A(ip2)  -  M2D  -  M2Y/2 
2  [A(i^2)  M2/2  -  Y  -  D]M2  =  QM2. 

Since  is  nondecreasing,  A2W  (A  ,A2s)  is  a  nondecreasing  function  of  s.  Thus, 
| |A2 | |  >  M2  implies 

a2wn(A)  >  a2wn(A1,m2a2| |a2| I'1) 

>(|  |A2|  |/M2)(M2A2|  |A2H'1Wn(A1,M2A2|  |A2|  I"1)) 


*  l|A2l|Q 

Thus, 

p  inf 

I  I A  I  I <M 
i I A2 I  I -M2 


a2wn(A) 

Ti^rr 


>  1-E 


which  with  the  Cauchy-Schwarz  inequality  proves  (3.2). 


□ 


4.  Estimation  of  jj.  The  limiting  distribution  of  B  is  a  simple  consequence 
of  the  following  representation. 

* 

Theorem  4.1.  Suppose  B1  to  B12  hold  and  A(i|0  >  0.  Then  if  B  is  any  solu¬ 
tion  (see  (1.10))  to 
N 

(4.1)  l  *((Yi-C?’  B)  exp(-h(t  )T  0)C.  exp^Mtj)1  9)  =  0 
i=l 

with  t.  =  cT  6^,  (Bq-B) =  0  (N  ^),  and  6^  satisfying  (3.2),  then 

(4.2)  N%(B-B)  =  N'V  l  S'ViMe.HAW))'1  ♦  on(l). 

i=l  1  p 

This  result,  strengthened  so  that  the  remainder  is  o(l)  almost  surely,  has  been 
established  in  the  homoscedastic  case  by  Carroll  and  Ruppert  (1979). 
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Proof.  For  A^  and  in  Rp,  A^  in  R^  and  A  =  (A^.A^.A  ),  define 

a?1}(A)  =  N'%  dT  A 
1  —l—l 

hj_  (A)  =  h(T.  ♦  C*  A3  N_%), 

a|2) (A)  =  exp(-hi (A)T  A2  N‘%+  (hi(0)-hi(A))T9)  -  1, 

and 

aP^(A)  =  d.  af2)(A). 

-l  —  -l  l  - 

Define  the  process  N 

U  (A)  =  N'%  E  'K(e.+aJ1)(A))(l+ap)(A))(di+a{3)(A)). 

N  —  i  =  l 

Note  that  (4.1)  can  be  rewritten  as 
(4*3)  Un(N%(6-B),  N%(0-0),  N% (Bq-6) )  =  0. 

Letting  g^  1 ,4>i  (ei>  r,s )  =  ^  C  (e±+  r)  (1  +  s)),  and  A(4>,i)  =  A(4>),  the  conditions 
of  Theorem  A.l  are  implied  by  Bl,  B3,  B5  to  B8,  Bll,  and  B12,  so  for  all  M>0 

(4.4)  sup  ||U  (A)  -  U  (0)  -  A(<|>)  SA  ||  =  o  (1). 

| | A | | <M  P 

Now  by  Chebyshev's  theorem  and  Bl, 

uN(0)  =  0p(l). 

Therefore,  if  we  set 

A*  =  -(A(^)S)-1  UN(0),  then 
A*  =  0p(l) 

and  therefore,  by  (4.4) 

U(A*)  =  op(l). 
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Consequently,  by  (4.3)  and  our  convention  (1.10), 

(4.5)  Un(N%(3-6),  N%(0-6),  N%(S0-6))  =  op(l). 

By  Theorem  3.1,  (6.-6.)  =  0  (N-*),  so  we  need  onl/  establish  that 

(4.6)  (6-3)  =  0p(N~V) 

to  conclude  from  (4.3)  and  (4.4)  that  (4.2)  holds.  But  by  (4.5),  (4.6)  holds 
if  for  each  n>0,  e>0,  and  Mj  there  exists  M2  satisfying 


P(  inf  inf  inf  ||U  (A)||>n)  >  1  -  e 

1 1 Aq 1 1 -M2  II42INmi  M43INm 


Now  (4.7)  follows  from  (4.4)  in  a  manner  quite  similar  to  Jureckova's  (1977) 

proof  of  her  Lemma  5.2.  ^ 

2 

Corollary  4.2.  If  E^  (e^)  <  00  and  the  assumptions  of  Theorem  4.1  are  met, 
then 

N%(0-0)  -  N (0,  S'W(ej)  AW-2)  . 


Proof.  Use  (4.2)  and  Loeve’s  (1963,  p  316)  Normal  Convergence  Criterion 
to  show  that  N*(0-0)Tx  +  n(0,  xTS_1  x  E*2(e1)  A(<)0"2  )  for  each  x  in  RP. 

Since  B7,  Bll  and  B12  show  that 


it  is  easy  to  see  that  the  criterion  is  met. 

Recall  that  A(<|>)  =  Eip’  (e^,  for  commonly  used  ip. 

Define 

A,  T  A  T  A 

oi  =  exp(h(CT  jSq)  0) 

a  a  a  sind 

(one  could  use  0  instead  of  0^) .  Then  when  (0-60  =  0p(N 

=  0p(N‘^),  B7  and  BIO  imply 

sup  |o.-o.|  =  o(l). 

•5  cM  F 
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Suppose  for  both  <p  =  \p  and  <|>  =  ip' 

lim  E  sup{  |<}>((l+x)  (e.+y))  -  4> (e1 )  | :  | x |  s  £  and  |y|  <  e}  =  0. 
e-K) 

Then  it  is  easy  to  prove  that 

In'1  I  (McrT1  (Y.-cT  S))  -  <J)(e.)  |  =  0  (1), 
i=l  F 

and  by  the  strong  law  of  large  numbers, 

1  N 

N  I  <Ke.)  ■*  E<Ke  ). 


Moreover, 


|  |N-‘  I  a'2  C.CT  -  N"1  2  d.dT|| 

11  .L.  1  —1-1  .  L,  -1-1  1  ' 

1=1  1=1 

A  _1  N  2 

<  sup  |a.-(T  I  (N  l  |  | Ci  I  I  )  =  0  (1) 
i<N  i=l  F 


(with  | | A | |  equal  to,  say,  the  Euclidean  norm  of  the  matrix  A), 
so 


i=l 


t  P 

C.Cl  ->■  S. 

—l—i 


2  -  2 

Therefore,  E<|>  (e^  A(ij>)  S  can  be  consistently  estimated,  and  large  sample 
confidence  ellipsoids  and  tests  of  hypotheses  for  £  can  be  constructed. 


5.  AT  -  consistency  of  preliminary  estimators.  Since  we  require  that  our  pre¬ 
liminary  estimate  satisfy  (1.5),  we  now  give  conditions  which  insure  that 

A 

an  M-estimate  0^  solving 

!  *(v£  V  Si  ■ 0 

i=l 

A 

satisfy  (1.5).  This  0^  is  not  scale  equivariant,  but  this  does  not  affect 

A 

the  limit  distribution  of  3. 


B13.  ip  is  odd 


-c 
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B14.  i|>  is  Lipschitz  continuous  with  Lipschitz  constant 

B15.  the  Radon-Nikodyn  derivative  i|>'  satisfies  E0|>(x(ej+y))  -  tKxe^  - 


(xepxy)  <  K|xy|*+0t  for  some  K  and  a>0  and  all  x  and  y 


B16.  N'%(a.+  1)|  |d.| 


B17.  sup (N~ 1  l  (o?+  1) I |d. | | 2)  < 
N  i=l 


B18.  the  minimum  eigenvalue,  A^,  of 


■  1  1  7 
N  l  o.  ©p^o.ej)  d.df 

i=l 


satisfies  lim  inf  A.,  >  0. 

N-x»  N 

Theorem  5.1.  Under  B13  to  B18,  (1.9)  holds. 

Proof.  We  will  apply  Theorem  A.l  with  <t>.(ej,r,s)  =  ^(c^  (ej-r))  (so  4^  does 
not  depend  on  s)  ,  g.=  o.,  k^  N"^|JCi||,  A(>M)  =  oi  Ei/fa^),  cxP^E  0, 

(2) 

and  ’  left  undefined.  By  B13  and  B14,  (A. 3)  holds.  By  B15,  for  all  r  and  r' 
|4>i(e1,r)  -  (^.(e^r')  |<  K  o|r-r'| 

so  that  (A. 4)  and  (A. 5)  hold.  Also,  (A. 5)  is  implied  by  B15,  and  (A.l),  (A. 2), 
(A. 7),  (A. 8),  and  (A. 9)  are  easily  checked.  Therefore,  for  all  M>0, 


sup  ||N'^  l  <Jj(o ■  (e.-d  A)  -  N  *  l  <|>(a.e .) 
|  |  A  |  |  <M  i=l  1  1  i  i=l  11 


+  N'1  y  a.  E^(o.e.)  d.dT  A|  |  =  0  (1). 

. L,  i  i  1  -l-i  11  p 

i=l 


Now  (5.1)  follows  exactly  as  Jureckova's  (1977)  Lemma  5.2,  since 

'T  A  T  A 

V  S  So  =  W  SiCfio-fi))- 


□ 
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APPENDIX 


The  following  general  theorem  will  be  used  when  studying  0^,  0  and  $. 

Theorem  A.  1 .  Let  g^  and  (=g.j^  an d  k^)  be  sequences  of  positive  constants 
such  that 


(A. 1)  lim  (sup  k  +  k.g.))  =  0 
i<N 

and 

N  y  7  7 

(A. 2)  sup  (  l  kf  ♦  kfgp  =  c  <  »  . 

N  i=l  1  11  1 

Let  <J>  be  a  function  from  R3  to  R*  satisfying 

(A. 3)  E«j»i(e1,0,0)  =  0  for  all  i, 

(A. 4)  lim  sup  E{sup|<J>.  (e  ,r,s  )  -  <J> .  (e. , r', s '  )  | : 

k-*0  1 

M»  lr'  I.  Is  |,  Is'  |  s  k  and  |r-r'|,  |s_s.  |  <  k6>  <  C06g. 
for  some  Cq  and  all  6<1  and  all  i, 

(A. 5)  sup  gT1  E(<He1,r,  s)  -  $^,0,0)  -  A(<t>,i)r)  =  O(|r|1+0t  +  |s|1+0t) 
i<N 

for  positive  constants  A(<J),i)  and  a>0,  and 

(A. 6)  lim  sup  E(4>.  (e  ,r,s  )  -  <|>.(e  ,0,0))2  =  0. 
r,  s-K)  i<N 

Let  ct^,  and  af3^  be  functions  from  Rm  to  R*,  R1  and  Rn,  respectively, 

such  that 

(A. 7)  a^(0)  =  0,  and 

(A. 8)  ||aj£)(x)  -  a.(£,)(y)||  <  kjlx-^l 

for  all  x  and  y  in  Rm,  i=l,...,N,  and  S.=l,2,  or  3.  Let  x^  (=x.N)  elements 

of  Rn  satisfying 

(A. 9)  N'*||xi||  S  k.  . 
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For  A  e  R  ,  define  the  process 

N 

Un(A)  =  N‘%  l  ♦i(ei,o{1)(A),  aPcAJJfx.+a^fA)) 
i=l 


Then,  for  all  M>0 


ulO)  sup  ||U  (A)  -  U  (0)  -  N'*  l  A($,i)a.(1)(A)x.  ||  *o  (1). 
||A||<M  N  N  i=l  P 


Proof  of  Theorem  A.l. 


Fix  M>0.  We  will  show  that 


(A. 11)  E(Un(A)  -  UN(0))  =  N~*  ^  A(<t>,i)o^1J  (A)  x.+  op(l) 


(A. 12)  Un(A)  -  UNC0)  -  E(Un(A)  -  UN(0))  =  op(l) 

for  each  fixed  A,  and  that  there  exists  K  depending  upon  M  but  not  6  such 
that  for  all  6>0  and  all  N 

(A. 13)  E  sup{| | UN (A)  -  Un(A*)| |:  ||A||<M,  ||A*||^M,  | | A-A* | |<  6}  <  K6. 

Since  for  any  6,  we  can  cover  a  ball  of  radius  M  in  Rm  with  a  finite  number 
of  balls  of  radius  6,  (A. 11),  (A. 12),  and  (A. 13)  prove  that  for  each  6>0, 

lim  p  sup  ||Um(A)  -  U  (0)  -  l  A(4>, i)aP }  (A)x.  |  |  <  K<S  =  1, 
|  |  A  |  |  £  M  N-  N-  i  =  1 

which  proves  the  theorem. 

To  prove  (A. 11),  note  that  by  (A. 3), 
i  N 

n-*  l  E(UN(A)  -  UN(0))  = 
i=l 

N-*  l  E(4..(ei,ap)(A),  aJ2)(A))  -  <t>.(e.,0,0))  (x^  af3)(A)), 
i  =  l 

by  (A.l)  and  (A. 8). 

(A.  14)  |  |x^+  af3)(A)||  <  211^11  for  all  large  N,  and  by  (A. 5), 

EWi(e.,a[1)(A),  a<2)(A))  -  ^(e^O.O))  = 

♦  OCg-k.llAH)140* 
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uniformly  in  i.  Therefore 

E((Un(A)  -  UN(0))  =  N"*  ^  AMyJa^Vjx. 

+  0(N‘%  l  (g.k  )1+0t  |  |x.  ||)f 
i=l  1  1  1 

and  by  (A.  2)  and  (A.  9), 

N'*J1(8iki)1™lliill  s  8^°  .I^A2  " 

S  (giki)2  (  l  g-k2)%(  I  k-)%  =  0(1). 

i=l  i=l 

Thus  (A. 1)  holds. 

Next,  by  (A. 14)  we  have  that  for  N  large 


Var(UN(A)  -  UN(0))  < 

N 

(2N_1  l  g2||x  ||2)  sup  g:2E(Ue  a.(2)(A))  -  <j>  (e  ,0,0))2. 

i=l  1  1  i<N  1  1  1  i  11 

It  follows  from  (A. 4),  (A. 5),  (A. 7),  and  (A. 8)  that 

sup  gj2  E(<J>.(e  ,af1}(A),  a{2)(A))  -  <J>.(e  ,0,0))2  =  o(l). 
i<N 

Therefore,  (A. 12)  is  proved  by  applying  (A. 2).  Finally,  by  (A. 14)  the  RHS  of 
(A. 13)  is  less  than  or  equal  to 

2N~^  l  |  | x i  I  I  E  Sup(|4>i(ei,a|1-)(A),  af2^(A))  - 
i=l 

^.(ei,ai(1)(A*),  aJ2)(A*)|:  ||A||<;M,  ||A||<M,  | |A-A*| |s  6} 
which  by  (A. 2),  (A. 4)  to  (A. 9),  and  the  Cauchy- Schwarz  inequality  does  not  exceed 

N  9  k  i 

sup  2(  [  (g.k  n*  Cn6  =  2C_C*  6. 

N  i=l  1  1  u  u  1 

Therefore  (A. 13)  is  verified.  0 
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